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The severe acute respiratory syndrome (SARS) coronavirus 
encodes several RNA-processing enzymes that are unusual for 
RNA viruses, including Nspl5 (nonstructural protein 15), a hex- 
americ endoribonuclease that preferentially cleaves 3' of uri¬ 
dines. We solved the structure of a catalytically inactive mutant 
version of Nspl5, which was crystallized as a hexamer. The 
structure contains unreported flexibility in the active site of 
each subunit. Substitutions in the active site residues serine 293 
and proline 343 allowed Nspl5 to cleave at cytidylate, whereas 
mutation of leucine 345 rendered Nspl5 able to cleave at 
purines as well as pyrimidines. Mutations that targeted the res¬ 
idues involved in subunit interactions generally resulted in the 
formation of catalytically inactive monomers. The RNA-bind- 
ing residues were mapped by a method linking reversible cross- 
linking, RNA affinity purification, and peptide fingerprinting. 
Alanine substitution of several residues in the RNA-contacting 
portion of Nspl5 did not affect hexamer formation but 
decreased the affinity of RNA binding and reduced endonucle¬ 
ase activity. This suggests a model for Nspl5 hexamer interac¬ 
tion with RNA. 


The Nidoviruses contain three families of viruses, includ¬ 
ing the Coronaviridae that cause numerous diseases in 
humans (1). Severe acute respiratory syndrome coronavirus 
(SARS-CoV) 3 * is a member of the Coronavirus genus (2, 3). It 
originated from animals but spread to humans, causing 
severe respiratory distress with a fatality rate of —10% (as 
shown by the World Health Organization, www.who.int/csr/ 
sars/country/en/country2003_08_l5.pdf). In addition to their 
medical importance, coronaviruses are of interest for their large 


* This work was supported by National Institutes of Health Grant 
1R56AI06164 and a Small Grant for Exploratory Research from the National 
Science Foundation. The costs of publication of this article were defrayed 
in part by the payment of page charges. This article must therefore be 
hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 
solely to indicate this fact. 

® The on-line version of this article (available at http://www.jbc.org) contains 
supplemental Table SI and supplemental Figs. S1-S3. 

The atomic coordinates and structure factors (code 2RHB) have been deposited in 
the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, 
Rutgers University, New Brunswick, NJ (http://www.rcsb.org/). 

1 Both authors contributed equally to this work. 

2 To whom correspondence should be addressed. Tel.: 979-458-2235; Fax: 

979-845-9274; E-mail: ckao@tamu.edu. 

3 The abbreviations used are: SARS, severe acute respiratory syndrome; CoV, 

coronavirus; WT, wild type; MALDI-ToF, matrix-assisted laser desorption 

ionization time-of-flight. 


—30-kb positive-strand genome and novel mechanisms that have 
evolved to replicate and transcribe this large RNA (5,6). In keeping 
with the novel strategies used, coronaviruses encode several 
unusual RNA-processing enzymes, including an RNA endoribo¬ 
nuclease, an RNA methyltransferase, a second RNA-dependent 
RNA polymerase that generates primers for coronavirus replica¬ 
tion (7), and a mechanism to decrease replication errors (8). 

Coronavirus subgenomic RNAs are particularly interesting 
in that they all have the same 5' leader sequence derived from 
the 5' end of the genomic RNA. This organization requires 
recombination as part of transcription. Various mechanisms 
have been proposed for subgenomic RNA production, but a 
discontinuous transcription mechanism is increasingly favored 
(5). This model proposes that transcription regulatory 
sequences in the minus-strand RNA direct translocation of the 
ternary complex to the 5' leader sequence, where transcription 
resumes. The minus-strand RNAs serves as the template for 
subgenomic RNA transcription. Ribonucleases that process the 
RNA intermediates for transcription have been proposed 
(9-11), but the mechanism for the process is still not com¬ 
pletely understood. 

Nspl5 (nonstructural protein 15) was predicted to be an RNA 
endoribonuclease as part of a bioinformatics analysis of the SARS- 
CoV genome (7). Substitutions of critical residues built into infec¬ 
tious clones of the human Coronavirus resulted in decreased infec- 
tivity and reduced genome RNA levels (12). A more thorough 
analysis of mutations in the Nspl5 ortholog of the related Arteri- 
virus revealed that several residues, including those in the putative 
active site, reduced viral plaque size and decreased viral titers by up 
to five logs (13). Similar results were observed with Nspl5 of 
mouse hepatitis virus A59 (14). In both arterivirus and mouse 
hepatitis virus, reduced subgenomic RNA levels were associ¬ 
ated with mutations in Nspl5. Other properties of virus infec¬ 
tion were also affected. Therefore, it is likely that Nspl5 and its 
orthologs play multiple roles in viral infection (14). 

To better understand how the activity of Nspl5 affects proc¬ 
essing of viral RNAs, Bhardwaj et al. (15) and Ivanov et al. (12) 
independently produced recombinant Nspl5 of the SARS-CoV 
in Escherichia coli and demonstrated endoribonuclease activity 
that preferentially cleaves RNAs at uridylates. This activity was 
stimulated by Mn 2+ but not by other divalent metals such as 
Mg 2+ (15). More recently, Bhardwaj et al. (16) demonstrated 
that Nspl5 cleaves 3' of uridylates through the formation of a 
2'-3' cyclic phosphodiester product. 

Ribonucleases are generally grouped on the basis of whether 
they cleave 5' or 3' of the cognate phosphodiester (17). The 
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former includes RNase H and RNase III, whereas the latter 
includes members of the RNase A family and RNase T1 (18). 
The two types of enzymes also differ in their metal ion depend¬ 
ence. The RNase H and III enzymes are metal-dependent, 
whereas RNase A and T1 are metal-independent (19). Thus, 
Nspl5 does not obviously belong to either group. Its cleavage is 
similar to RNase A, but its activity is enhanced by metal ion. 
This combination of properties is shared by XendoU, an 
endoribonuclease involved in small nucleolar RNA biosynthe¬ 
sis (19). 

An unusual feature of Nspl5 is that it assembles into a hex- 
amer in solution and that disruption of hexamer formation 
reduces endoribonuclease activity (20). Transmission electron 
microscopy analysis showed that Nspl5 consists of a dimer of 
trimers that interact in an end-to-end fashion (16). A low reso¬ 
lution co-crystal of Nspl5 and RNA suggested that the RNA 
binds to the outside of the hexamer and interacts with both 
trimers (16). 

Two Nspl5 structures have been determined. Ricagno et al. 
(21) determined the x-ray structure of the SARS-CoV Nspl5 
protein from crystals with one protomer in an asymmetric sub¬ 
unit. The crystal structure of mouse hepatitis virus Nspl5 was 
solved by Xu et al. (22), also with one subunit in the asymmetric 
unit. When arranged into a hexamer, the six active sites are 
exposed at the surface of the hexameric structure. Further¬ 
more, the active site of Nspl5 can be modeled according to the 
residues of RNase A that share a common mechanism of catal¬ 
ysis (21, 22). In this structure, the C-terminal tail of Nspl5 folds 
back on the active site, suggesting that it may play a role in 
specific recognition of the cognate uridylate. These crystal 
structures, however, lack RNA, and the details of RNA interac¬ 
tion and many requirements of the subunit interactions remain 
unclear. 

Using the coordinates of Ricagno et al. (21), we solved the 
structure of an active site mutant of the SARS-CoV Nspl5 by 
molecular replacement. Notably, this structure has six subunits 
within the asymmetric unit. Predictions of the structure were 
examined, leading to elucidation of the requirements for spec¬ 
ificity in RNA cleavage, subunit interaction, and RNA binding. 

EXPERIMENTAL PROCEDURES 

Crystallization and Data Collection ofNspl5-H234A —Initial 
crystallization trials were performed in sitting drops using 15 
mg/ml of the Nspl5 mutant, H234A. The Hydra 96 robotic 
system (Robbins Scientific Co.) was used for Hampton and 
Wizard (Emerald Biostructures) crystallization screens. Three 
different conditions, all containing 2-methyl-2,4,-pentanediol 
as the precipitant, were judged promising for further optimiza¬ 
tion by the hanging drop vapor diffusion method of crystalliza¬ 
tion. Diffraction quality crystals were obtained at 10-15 mg/ml 
of H234A in 0.05-0.1 m MgCl 2 , 3-5% 2-methyl-2,4,-pen- 
tanediol, and 0.1 m tri-sodium citrate dehydrate (pH 5.6-5.9). 
The best crystal diffracted up to 2.64 A at the synchrotron beam 
line 14-ID of Advance Photon Source, Chicago (Table 1). The 
crystal was mounted in cryoloop and flash-cooled in a liquid N 2 
stream after a brief soak in 30% ethylene glycol. The diffraction 
data set was indexed with unambiguous solution and processed 
using HKL2000 (23). 


TABLE 1 

Data collection and refinement statistics 


Data collection 

Cell dimensions (A) 

Space group 

Number of molecules per asymmetric 
unit (Z) 

Resolution (A) 

Completeness (%) a 

tW 0/ °) 4 

I/sigma(I) 

a = b = 305.757 and c = 88.74 

R3 

6 

50-2.8 

92.0 (78.2) 

6.5 (37.2) 

17.5 (2.6) 

Refinement statistics 

Resolution (A) 

50-2.8 

Number of reflections in working 

66521 

data set 

Number of reflections in the test 

3507 

data set 

Rc,ysA%Y 

19.7 

« fr ee(%) 

25.3 

Ramachandran statistics 

Most favored (%) 

90 

Allowed (%) 

10 

Root mean square deviation 

From ideal bond lengths (A) 

0.015 

From ideal bond angle (°) 

0.8 


a The values in parenthesis represent values for the highest resolution shell. 

'Xym = ShSJ/hj - < 4 >|/ 2 h 2 l 4 il ,where 4 i i is the ith observation of the reflection 
h, whereas </ h > is the means intensity of reflection h. 

C Hryst = 2|f„l - |fJ/|fol- «ftee was calculated with a fraction (5%) of randomly 
selected reflections excluded from refinement. 

Structure Determination and Refinement —The crystal unit 
cell indicated the presence of six molecules in the R3 space 
group per asymmetric unit. The positions of each molecule 
were solved by molecular replacement using the program 
Phaser v.1.3 (24) with the previously published catalytically 
active structure of Nspl5 (Protein Data Bank code 2H85) (21) 
as the search model. Model building was accomplished using 
Xtalview (25) to examine the 2 F a — F c , F a — F c , and composite 
omit electron density maps and refinement using maximum 
likelihood restrained refinement method in CCP4-REFMAC 
(26, 27) were performed. After the model reached R factors of 
less than 30%, water molecules were added to the structure 
using automated water picking in Xtalview. Medium or tight 
noncrystallographic averaging increased the R factors, and 
therefore only loose noncrystallographic averaging restraints 
were applied during the REFMAC refinement. The final struc¬ 
ture has an R factor of 19.7% and an R {lee value of 25.3% with 
good stereochemistry as analyzed by PROCHECK (28). The 
final refinement statistics are in Table 1. The figures were pre¬ 
pared using Spock ( quorum.tamu.edu ) and Chimera (30). The 
atomic coordinates of the final model have been deposited in 
the Protein Data Bank with the code 2RHB. 

Molecular Docking —Dock 6.0 and its accessory programs 
(31) were used to carry out the flexible docking of 3'-UMP or 
CMP (from Protein Data Bank code 4RSK-RNASE A:3'UMP 
complex structure) into the Nspl5 crystal structure (Protein 
Data Bank code 2H85), which was kept rigid. DMS (32), a soft¬ 
ware that computes molecular surface, was used to create the 
molecular surface of the receptor. The negative image of the 
binding site was defined using Sphgen (31) within the 10 A 
radius of the catalytic site residues His 234 , His 249 , Ser 293 , Lys 289 , 
and Tyr 342 to adopt the sphere-matching algorithm of Dock 
6.0. An incremental construction (Anchor-and-Grow method) 
was used to allow for flexibility of the ligand. Automatic match- 
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ing mode was used with the 20 configurations/ligand building 
cycle. Interaction between the ligand and the catalytic site was 
evaluated by the grid score (33, 34) followed by visual 
inspection. 

Constructing the Nspl5-S293A Model —The atomic coordi¬ 
nates of Nspl5 crystal structure (Protein Data Bank code 2H85) 
were used to construct a model of Nspl5-S293A using Modeler 
(Insight II, www.accelrys.com). Residues within a 10 A radius of 
A293 were subjected to a medium level simulated annealing 
optimization with respect to the variable target function of 
Modeler. 

Site-directed Mutagenesis and Protein Purification —Amino 
acid substitutions were introduced in the Nspl5 expression 
plasmid using the QuikChange site-directed mutagenesis pro¬ 
tocol as recommended by Stratagene. The entire open reading 
frame was sequenced to confirm the presence of directed muta¬ 
tion and absence of unintended mutations. WT and mutant 
proteins with a His tag at their N termini were expressed in 
E. coli and purified by metal ion affinity chromatography, fol¬ 
lowed by Mono Q ion exchange chromatography as described 
previously (15). The purified proteins were stored in 50 mM Tris 
(pH 7.9), 300 mM NaCl, 1 mM dithiothreitol, 50% (v/v) glycerol 
at —20 °C. The protein concentrations were determined by 
absorbance at 280 nm. 

Size Exclusion Chromatography —WT or mutant protein 
samples (—100 pg) were filtered through a Superdex 200 col¬ 
umn using a Pharmacia fast protein liquid chromatography sys¬ 
tem calibrated with the following molecular mass standards 
(kDa in parentheses): blue dextran (2000); thyroglobulin (660); 
ferritin (440); catalase (232); aldolase (140); bovine serum albu¬ 
min (67); and chymotrypsinogen (25) according to the protocol 
described in Guarino et al. (20). 

Reversible Protein-RNA Cross-linking and Peptide Mapping — 
H249A (10 jum) was incubated with or without 20 pM biotiny¬ 
lated RNA, Bio-rUCIO in 20 mM Hepes (pH 7.5), 5 mM MnCl 2 , 
and 1 mM dithiothreitol at room temperature for 5 min fol¬ 
lowed by the addition of formaldehyde to a final concentration 
of 0.1% in a 25-p\ reaction. The cross-linking reaction was 
stopped by the addition of glycine at 0.2 m final concentration. 
After 5 min, sequencing grade trypsin (Trypsin Gold; Promega, 
Madison, WI) was added at a protease:substrate ratio 1:50 
(w/w) in 100 mM NH 4 HC0 3 (pH 7.8) at 37 °C and allowed to 
digest overnight. Streptavidin magnetic beads (New England 
Biolab, Beverly, MA) were used to capture biotinylated RNA 
and RNA-peptide conjugates as described previously by Kim et 
al. (4). RNA-peptide conjugates were reversed by incubating 
the samples at 70 °C for 1 h. The samples were centrifuged at 
3000 X g for 5 min, and the supernatants containing the pep¬ 
tides were desalted using a Ziptip (Millipore, Bedford, MA). 
The bound peptides were eluted in 2.5 p\ of 70% acetonitrile 
and 0.1% trifluoroacetic acid. The eluted samples were analyzed 
by MALDI-ToF. 

Affinity Chromatography —10 pg of WT or mutant proteins 
were incubated on ice with —2 mg polyU-agarose in buffer A 
(50 mM Tris, pH 7.5, 5% glycerol, 1 mM j3-mercaptoethanol, 10 
mM MnCl 2 ) containing 30 mM NaCl. After 90 min, the agarose 
beads were washed by centrifugation at 6,000 rpm for 10 min to 
remove unbound protein. Bound protein was eluted with buffer 


A containing increasing concentrations of 50, 100, 200, 300, 
400, and 500 mM NaCl. The fractions were analyzed by SDS- 
polyacrylamide gel electrophoresis followed by staining with 
Coomassie Brilliant Blue. Protein was quantitated using the 
Coomassie Plus Protein Assay kit from Pierce. 

Endoribonuclease Assays —The real time endoribonuclease 
assay used a substrate purchased from Integrated DNA Tech¬ 
nologies, Inc. (Coralville, Iowa). The substrate has a 5' carboxyl 
fluorescein fluorophore at the 5' end and tetramethylrhoda- 
mine at the 3' end, which quenches 5' carboxyl fluorescein 
fluorescence. Of the four nucleotides, only the cognate cleavage 
nucleotide is a ribonucleotide, whereas the others are deoxyri- 
bonucleotides. Protein and RNA substrate were incubated in 20 
mM Hepes (pH 7.5), 5 mM MnCl 2 ,1 mM dithiothreitol, as indi¬ 
cated. Substrate cleavage was monitored with an excitation 
wavelength of 492 nm and an emission wavelength of 518 nm 
(16). Changes in fluorescence were measured over time using 
an LS55 spectrometer (PerkinElmer Life Sciences). 

RESULTS AND DISCUSSION 

The Crystal Structure of Nspl5 Mutant EI234A —A mutant 
version of Nspl5 protein with an alanine substitution for one of 
the active site histidines, H234A, was deemed suitable for crys¬ 
tallization because it could be expressed and purified at a level 
more than 10-fold higher levels than the WT Nspl5. Further¬ 
more, it, like WT Nspl5, purified as a hexamer (20) and was 
competent for RNA binding (16). 

A single crystal of H234A protein, hereafter named Nspl5 H , 
diffracted up to 2.64 A with unit cell parameters a = b = 
305.757 and c = 88.740 in the R3 space group. The asymmetric 
unit contained six molecules, and the positions of each subunit 
of Nspl5 H were solved by molecular replacement using the 
structure of the catalytically active Nspl5 V291G-D300G dou¬ 
ble mutant (Protein Data Bank code 2H85) (21) as the search 
model. The Ricagno et al. (21) model, hereafter named 
Nspl5 VD has only one molecule/asymmetric unit. The asym¬ 
metric unit of Nspl5 H provides additional insight into the het¬ 
erogeneity among individual subunits of the hexamer and 
allows comparison with the WT active site in Nspl5 VD . 

The overall structure of Nspl5 H is very similar to that of 
Nspl5 VD , with a root mean square deviation difference of 0.53- 
0.73 A (344Ca atoms of each subunit of Nspl5 H were super¬ 
imposed on Nspl5 VD structure using the LSQKAB routine of 
CCP4 (26)). Both Nspl5 H and Nspl5 VD consist of a dimer of 
trimers, which is consistent with the solution state and with 
single Nspl5 particles analyzed by transmission electron 
microscopy and cryoelectron microscopy (16). 

Each subunit contains nine a-helices and twenty-one 
j3-strands. A subunit is further organized into three domains: 
an N-terminal domain (residues 1- 61), a middle domain (res¬ 
idues 62-181), and a C-terminal domain (residues 182-345) 
(Fig. L4). Alanine substitutions of highly conserved residues 
have demonstrated that the C-terminal domain contains the 
active site (20), which faces away from the center of the hex¬ 
amer and contains the extreme C-terminal residues, consistent 
with the observations of Ricagno etal. (21). When viewed from 
the top of the hexamer, a pore through the trimer is evident, 
with the N-terminal domains of the trimer lining the bottom of 
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FIGURE 1. Structure of SARS-CoV Nspl 5. A, surface representation of a subunit showing three domains. Active site residues are colored by element and 
indicated by an arrow. B, arrangement of six subunits from top view of hexamer. N-terminal, middle, and C-terminal domains are indicated as N, M, and C, 
respectively. The arrows indicate the positions of catalytic sites in top trimer. C, side view of hexamer showing arrangement and N-to N-terminal interaction of 
the top trimer (77) with bottom trimer (72). Six subunits are labeled as o-cand are colored as follows:T1 a, pink;T\ b, blue; Tic, cyan; T2a, red;T2b, green; T2c, 
golden. This color scheme is used throughout the figures. Catalytic residues are colored yellow. D, worm diagram drawn based on b-factor. Worm thickness is 
directly proportional to flexibility, i.e. the thickest region indicates most flexible. E, overlap of the catalytic residues within the active sites of the six subunits. The 
structures are anchored by the backbond of His 249 . 


the pore (Fig. 15). This pore has an inner diameter of ~12 A and 
does not interact with the substrate RNA according to the cryo- 
electron microscopy structure (16). The middle and the N-ter¬ 
minal domains have extensive contacts with the other subunits 
of the hexamer. These interactions are evident in the side view 
of the Nspl5 hexamer (Fig. 1C). 

Heterogeneity in the Six Subunits in Nspl5 —There are signif¬ 
icant differences among the subunits of Nspl5 H that are not 
observed in Nspl5 VD . To better describe these differences, the 
three subunits of the upper trimer, as oriented in Fig. 1 ( B-D ), 
are labeled Tla, Tlb, and Tic, and the lower trimer subunits are 
labeled T2a, T2b, and T2c. The T2 subunits have B factors of 
49.3, whereas the T1 subunits have B factors of 69.7 A 2 , respec¬ 
tively. The increased flexibility is shown by the worm diagram 
in Fig. ID. In addition, flexibility is significantly higher in the 
C-terminal domains of each subunit than for the rest of that 
subunit (Fig. ID). Within subunits T1 and T2, the root mean 
square deviation difference in the N-terminal domain is 0.27- 
0.34 A, whereas the C-terminal domain had a root mean square 
deviation difference of 0.51-0.57 A. The crystal packing may 
have some effect on the observed variations. This is insufficient, 


however, to affect the entire domain of each subunit. Moreover, 
the trimers showed significant differences between each other 
despite being in a similar environment. 

Heterogeneity is most obvious in the subunit catalytic pock¬ 
ets formed by residues His 234 , His 249 , Trp 332 , Tyr 342 , and 
Lys 289 . Superposition of the active site residues showed the 
degree of conformational flexibility between the subunits (Fig. 
ID). Although the positions of Lys 289 and His 234 are highly sim¬ 
ilar in each of the subunits, Tyr 342 is especially flexible and 
adopted different rotomer conformations in the different sub¬ 
units (Fig. IE). In addition, Trp 332 is disordered in subunit T2c 
and not visible in the electron density map. The distance 
between the side chain oxygen atom of Tyr 342 to el-nitrogen 
atom of Trp 332 varied between 3.0 and 4.1 A in Nspl5 H . 

Subunit Interactions —Based on the examination of a panel of 
mutants, Guarino et al. (16, 20) proposed that hexamer forma¬ 
tion is a prerequisite for enzymatic activity and RNA binding. A 
key mutant in this analysis was E3A, located in the N-terminal 
domain of Nspl5. It was inactive at lower concentrations but 
recovered some catalytic function at higher concentrations that 
should favor oligomerization (20). Structural data suggests that 
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FIGURE 2. Subunit interaction. A, wire diagram of subunit T1 a and its interaction with other subunits. Contact regions are boxed and labeled (boxes B-E). The 
inset shows molecular arrangement and atomic distances between E3 and L2 (left) and the gel filtration profile of the E3A mutant of Nsp15 (20). H, T, and M 
denote positions corresponding to elution volume of a hexamer, trimer, and monomer of Nspl 5, respectively. B-E, contacting residues and the calculated 
distances between them (left). Atomic distances are calculated using Chimera. Gel filtration chromatography elution profiles of the indicated mutant proteins 
(right). The values below the mutant name refer to the cleavage rate relative to WT in parentheses. 
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E3 and the adjacent L2 residue of one subunit are involved in 
trimer-trimer interactions with L2 and E3 of a subunit in a 
different trimer (Fig. 2A, inset). This interlocking structure thus 
provides a reasonable explanation for the effects of the E3 
mutation on Nspl5 oligomerization (20). Furthermore, because 
oligomerization is required for activity, it is important to examine 
oligomerization in greater detail. 

The crystal structure predicts multiple interactions that 
should help stabilize the Nspl5 hexamer (Fig. 2 A). We exam¬ 
ined four of the contacting surfaces, named B through E, by 
making mutations and examining their effects on Nspl5 struc¬ 
ture and function. In Fig. 2 A the bottom side of the T1 trimer 
(Tla, Tib, and Tic) is shown interacting with only one subunit 
of the T2 trimer (T2b). Contacting surfaces B, C, and D make 
contacts within a trimer, whereas E contains interactions 
between trimers. Supplemental Table SI provides the distances 
involved in these subunit contacts. The interactions are pre¬ 
dicted to be similar for both Nspl5 VD and Nspl5 H . However, 
Arg 61 of Nspl5 H salt bridges with Glu 266 (contact point D; sup¬ 
plemental Table SI). Side chain atoms of all of these residues 
are well ordered. 

In interaction surface B, the backbone oxygen atom of resi¬ 
due Asp 39 of subunit Tla forms an ionic bond with the Hl-ni- 
trogen atom of Arg 90 in Tic (Fig. 2 B). In addition, the S2-oxy- 
gen atom of Asp 39 forms a H-bond with the yl-oxygen atom of 
Thr 48 in Tic. To test the functional relevance of these interac¬ 
tions, we analyzed hexamer formation and enzymatic activity of 


the mutant versions D39A and R90A. Oligomerization state 
was analyzed using a rapid and reproducible gel filtration assay. 
The validity of this assay has been previously confirmed by ana¬ 
lytical centrifugation assays and also a native gel analysis ( 20 ). 
Both D39A and R90A eluted in a gel filtration column as mono¬ 
mers (Fig. 2 B), confirming the importance of these residues in 
hexamer formation. 

A real time fluorescent assay previously described in Bhard- 
waj et al. (16) was used to monitor endoribonuclease activity. 
The substrate contains three deoxyribonucleotides, one cleav- 
able ribonucleotide (rU), a 5' fluorophore, and a 3' quencher. 
Cleavage of the substrate results in an increase in fluorescence, 
which can be measured in real time. Functional analysis of 
D39A and R90A, using the real time endoribonuclease assay, 
revealed that the activity of these mutants was less than 1 % of 
the WT protein. 

In surface C, the backbone oxygen atom of N163 in Tib 
interacts with the ^-nitrogen atom of I<264 in Tla. In addition, 
the S2-nitrogen atom of N163 in Tib bonds the 72 -oxygen 
atom of T281 in Tla (Fig. 20). Alanine substitution of Asn 163 
should disrupt the latter interaction and resulted in a nearly 
equal distribution between the monomer and the hexamer, 
suggesting that hexamer formation was affected but not abol¬ 
ished. In the real time endoribonuclease activity assay, N163A 
retained 29% of the activity of the WT protein. 

In surface D, the Hl-nitrogen atom of Arg 61 in subunit Tla 
interacts with the el-oxygen atom of Glu 266 in Tic. Gel filtra- 
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FIGURE 3. Analysis of the Nspl 5 catalytic site. A, molecular docking of UMP into the Nspl 5 active site. The 
backbone of active site residues is shown in gray, and that of UTP is in yellow, nitrogen and oxygen atoms are 
blue and red, respectively. The phosphorus is cyan. Atomic interaction is denoted by broken lines, and the 
calculated distances are indicated. B, results of cleavage of substrate rU with WT and mutant Nspl 5, using real 
time fluorescent endoribonuclease assay. The identity of protein and rate of reaction is indicated to the right of 
the graph. The endoribonuclease activity was tested at 0.05 p,M of WT or mutant proteins. C, Gel filtration of 
Nspl 5. Elution profiles for WT Nspl 5 and mutant versions as indicated next to the curve. The expected posi¬ 
tions for hexamer (77), trimer (71, and monomer (M) are shown in each panel. D, SDS-PAGE analysis of polyU- 
agarose affinity chromatography fractions. WT or mutant Nspl 5 proteins were incubated with polyU-agarose 
for 90 min at 30 mM NaCI. Unbound fraction was removed by a low speed centrifugation, and the slurry was 
washed sequentially with increasing NaCI concentration. Samples of each NaCI concentration wash were 
analyzed by SDS-PAGE and Coomassie Blue staining. 


tion analysis of the mutant protein E266A showed that it eluted 
primarily as a monomer. Furthermore its endoribonuclease 
activity was less than 0.4% of WT (Fig. 2D). With the three sets 
of mutants that affected intratrimer interactions, it is interest¬ 
ing to note that only mutant N163A retained some ability to 
form hexamers and also retained some endoribonuclease 
activity. 

Interactions within surface E highlight the importance of 
intertrimer interaction. One such interaction is between the 
Sl-oxygen and 81-nitrogen atoms of Asn 52 in Tla with back¬ 
bone nitrogen atom of lie 26 and oxygen atom of Val 24 , 
respectively, in T2b. When mutant N52A or I26A,I27A were 
tested, both proteins existed primarily as monomers. Again, 
both mutants were severely debilitated for endoribonuclease 
activity, with cleavage rate of rU being less than 1% of WT 
(Fig. 2 E). 

Interestingly, mutants E3A, N52A, and 126,27A, which were 
predicted to form trimers because these residues are involved in 
trimer-trimer interactions, all eluted as monomers and not tri¬ 
mers as we expected (Fig. 2, A, inset, and E). These results sug¬ 
gest that a trimer is not a stable intermediate in the formation of 
hexamers and that the equilibrium favors monomers and hex¬ 
amers. Some trimer (<10%) was seen previously in gel filtration 
columns (20). It is possible that these reflect a dissociation 
event. 


Functional Analysis of Active Site 
Residues —An interesting aspect of 
the Nspl5 active site is that the 
C-terminal tail contributes to its 
formation. To obtain an initial 
model for substrate recognition at 
the Nspl5 active site, we performed 
a molecular docking analysis with 
UMP. The results suggested that 
some of the active site residues 
might contribute to specific interac¬ 
tions with U and help discriminate 
against a C (Fig. 3 A). First, Ser 293 
could potentially FI-bond with the 
N 3 imino hydrogen and the C 2 car¬ 
bonyl oxygen of the uracil. This 
information is consistent with the 
proposal of Ricagno et al. (21), who 
postulated that Ser 293 was compara¬ 
ble in spatial location to Thr 45 , the 
residue in RNase A that hydrogen 
bonds to the base and specifies 
cleavage. Second, the phenyl side 
chain of Tyr 342 could stack against 
the uracil, resulting in Pro 343 and 
Feu 345 occupying the space above 
the C4 carbonyl of uracil. In addi¬ 
tion, three basic residues, Fys 289 , 
His 234 , and His 249 that have been 
previously identified by bioinfor¬ 
matics analysis to participate in 
catalysis (7) could form an ionic sur¬ 
face that interacts with the ribose 
portion of the uridylate and the adjacent phosphate. Consistent 
with their proposed roles in catalysis, these three residues are 
positioned to act on the ribose 2' OH and the adjacent phos- 
phodiester group (Fig. 3 A). It is known that mutations in any of 
these three residues reduced RNA cleavage to background (20). 

To examine whether and how residues in the catalytic site, 
including the C-terminal tail, affect endoribonuclease activity, 
we made mutant versions of Nspl5 that changed Ser 293 and 
Tyr 342 to alanines to disrupt H-bond or stacking interactions, 
which these residues might have with substrate. Pro 343 and 
Feu 345 were mutated to glycines to widen the catalytic pocket. 
Endoribonuclease analyses with the rU substrate revealed that 
the WT protein had a cleavage rate of —1.5 jtrniol . The rate 
for mutant S293A was —0.25 p,mol _1 , P343G and F345G 
cleaved the substrate at 0.02 and 0.055 pmoU 1 respectively, 
whereas the activity of Tyr 342 was undetectable (Fig. 3 B). These 
results confirm that the residues identified in the catalytic 
pocket are involved in catalysis. 

Based on the structure, we did not expect oligomerization to 
be affected by the changes in proteins S293A, Y342A, P343G, 
and F345G. To test this directly, however, we examined the 
oligomerization state of these proteins by gel filtration. WT 
Nspl5 and all four of these mutants eluted in a position corre¬ 
sponding to hexamers (—240 kDa) (Fig. 3C), confirming that 
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FIGURE 4. Examination of substrate specificity. A, UMP docked into catalytic pocket of WT Nspl 5. 6, sche¬ 
matic to denote uridine recognition by S293 and the approximate distances between UMP and nearby cata¬ 
lytic site residues. C, results of real time fluorescent endoribonuclease assay. 0.25 ju.m of WT or mutant S293A 
proteins were used in these assays, and cleavage of substrates rU, rC, rA, and rG was measured in real time. The 
results of rU and rC cleavage are shown in real time. The slopes of each curve represent the rate of reaction. The 
identities of protein and substrate along with rate of reaction are indicated to the right of the curves. Cleavage 
rates of rA and rG are shown in the inset. D, cleavage of fluorescent substrates by WT or L345G proteins. Each 
protein (0.25 /am) was incubated with one of the four fluorogenic substrates (rU, rC, rA,or rG)for20 min at room 
temperature and measured the fluorescence for each treatment. The increased cleavage of the rA substrate by 
L345G was confirmed in kinetic assay in two other independent experiments (data not shown). 


the C-terminal tail does not affect oligomerization but is 
required for endoribonuclease activity. 

To determine whether these catalytic site mutants were 
affected for interactions with the RNA substrate, we performed 
affinity chromatography using polyU-agarose resin. To prevent 
RNA cleavage, the chromatography was performed with a spe¬ 
cific active site inhibitor that decreased Nspl5 endoribonucle¬ 
ase activity to background (to be published). Fig. 3 D shows 
SDS-polyacrylamide gel electrophoretic analysis of protein 
fractions eluting at different salt concentrations. WT Nspl5 
eluted from polyU agarose at 200 and 500 mM NaCl (Fig. 3D). 
The elution profiles of mutants S293A and P343G were similar 
to WT, whereas mutants Y342A and L345G eluted at lower salt 
(Fig. 3D). These results show that RNA binding was affected by 
alterations in some of the active residues. Among this panel of 
mutants, Tyr 342 showed the most dramatic effect on binding 
and was also the most affected with respect to activity. This 
does not always correlate, however, because H249A is catalyti- 
cally inactive and demonstrates WT binding to polyU-agarose 
(data not shown). 

Specificity for Uridylate —Molecular docking of UMP into 
the Nspl5 active site (Fig. 4A) suggests that N 3 and O 2 of the 
uridylate could form H-bonds with "/-oxygen atom and back¬ 
bone nitrogen atoms, respectively, of Ser 293 , whereas O 4 could 
interact with either Leu 345 or Pro 343 via Fl-bonds (Fig. 4S). This 
possibility is supported by our earlier results showing that mod¬ 
ification of either N 3 or O 4 of uridylate affected cleavage by 


Nspl5 (16). Cytosine should not 
form equally strong interactions 
with the active site residues. 
Although the O 2 of cytosine could 
interact with backbone nitrogen 
atom of Ser 293 , the y-oxygen atom 
of Ser 293 would need to reorient to 
form a second H-bond with N 3 of 
cytosine. Additionally, the bulkier 
NFI 2 group at C 4 could sterically 
hinder the fit of cytosine in the 
pocket (Fig. 4 B, inset). 

Molecular modeling and energy 
minimization of mutant S293A with 
CMP or UMP indicate that in 
absence of Ser 293 , Pro 343 could 
interact with either O 4 of uridylate 
or the C 4 amino group of cytosine to 
stabilize binding (supplemental Fig. 
SLA). In addition, the side chains of 
Lys 289 , Leu 248 , and Leu 345 would 
adjust to better complement the 
electronics of a cytosine in the active 
site (supplemental Fig. SIB). The 
S293A mutation should also result 
in a decreased anionic nature of the 
active site cleft and a narrowing of 
the active site cavity, which could 
accommodate a uridylate or a cyti- 
dylate equally well, although with a 
lower affinity as compared with the 
affinity of WT protein to a uridylate. Taken together, our obser¬ 
vations from the models is that uridylate should be a preferred 
substrate over purines or a cytidylate because it can form stron¬ 
ger specific interactions and has a better steric fit in the catalytic 
pocket. 

To test these predictions, first we analyzed the effect of muta¬ 
tions that would potentially alter the interaction between uri¬ 
dylate and the catalytic residue Ser 293 . In addition to the already 
characterized S293A, we changed Ser 293 to asparagine (S293N) 
or threonine (S293T). All three mutant proteins eluted in a gel 
filtration column as hexamers (data not shown). To examine 
the specificity for RNA cleavage with greater sensitivity, we 
used the fluorescent real time assay and with 0.25 /am of S293A 
or WT Nspl5, a concentration that is 5-fold higher than our 
typical assay. We also compared the normal rU substrate and a 
similar substrate having cytidylate as the only ribonucleotide 
(named rC), as well as substrates with guanosine (rG) and aden¬ 
osine (rA). WT reproducibly cleaved rU with a rate of 4.8 
qinoU 1 and rC with a rate of 0.285 qmoU 1 (Fig. 4C). The ratio 
of rU to rC cleavage, or the specificity factor, is 16.8 for WT. 
There was no cleavage by the WT protein with rG or rA (Fig. 4 D 
and data not shown). 

Mutant S293N was severely decreased in activity, with rates 
of cleavage for rU and rC at less than 0.017 qmol -1 S293T 
cleaved rU and rC with rates of 4.6 and 0.28 /xmoU \ respec¬ 
tively, thus generating a specificity factor of 16.4, which is com¬ 
parable with that of WT (supplemental Fig. S2). The fact that 
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S293T is not affected in endoribonuclease activity is not sur¬ 
prising given that the comparable residue in the mouse hepati¬ 
tis virus Nspl5 is a threonine, and the infectivity of EAV was not 
significantly affected either residue at the comparable position 
in its endoribonuclease ortholog (13). 

S293A had a more interesting activity profile. It had reduced 
cleavage rate, with rU being cleaved at 2.35 pmoP 1 (Fig. 4C). 
Unlike WT enzyme, however, S293A cleaved rU and rC with 
comparable rates (2.55 and 2.35 /xmol -1 ), resulting in a speci¬ 
ficity factor of 1.1. Furthermore, S293A has comparable K m 
values for rU (26.7 + 8.5 /tm) and rC (28 + 2.6 p, m), values 
similar to that of WT Nspl5 for rU (30.3 ± 9.5 /cm), hike WT, 
S293A did not cleave rA or rG, thus discrimination against 
purine substrates was not altered. These results support our 
molecular docking predictions and the idea proposed by Ric- 
agno et al., (21). These modeling results suggest that Ser 293 
participates in specific recognition of the substrate by Nspl5 
and further extends the similarities in the cleavage mechanism 
for Nspl5 and RNase A. 

Next, we analyzed the mutants P343G and F345G. We pre¬ 
dicted that these substitutions could potentially widen the 
pocket and/or increase flexibility in the active site. P343G was 
reduced in catalytic activity to 1.3% of WT and had a specificity 
factor of 10.4 ± 2 for rU/rC. The more interesting result was 
with F345G. Feucine at this position should sterically block the 
binding of a purine in the WT protein (Fig. 4A). Substitution of 
glycine at this position decreased overall cleavage activity and 
resulted in a lower rU/rC specificity factor of 2.3 ± 0.2 relative 
to WT. Interestingly, F345G cleaved purine substrates at a sig¬ 
nificantly higher rate than WT or other catalytic mutants (Fig. 
4D). Because leucine 345 is the penultimate residue, these 
results suggest that the C terminus of Nspl5 contributes to the 
specific recognition of uridylates in part by preventing the 
access of other nucleosides into the active site. 

Nspl5-RNA Interactions —Images of Nspl5 in 2D crystals in 
complex with RNA indicates that an RNA of 16 nucleotides 
binds across the interface between two trimers (16). Further¬ 
more, only hexamers bind RNA in solution, indicating that 
RNA binding follows oligomerization (16). The catalytic pock¬ 
ets are surface-exposed in both Nspl5 VD and Nspl5 H crystal 
structures. These pockets, however, are too narrow in both 
structures to accommodate an RNA molecule, and a conforma¬ 
tional change is necessary for RNA binding. The flexibility 
observed in this region may be a part of the mechanism to allow 
RNA docking into the catalytic pocket. 

Three different RNA access paths for the catalytic site of 
Nspl5 hexamer were identified based on surface complemen¬ 
tarity and were finalized by manual docking of a single-stranded 
RNA of sixteen uridylates (built using the biopolymer module 
of Insight II (Accelrys Inc.)). These dockings were done with 
some steric clashes that needed to be resolved by slight confor¬ 
mational changes in the protein. The three potential paths for 
RNA binding are shown in Fig. 5 A. To examine whether any or 
all of these potential paths contact RNA, we used a method that 
does not contain built-in bias in selecting RNA-contacting res¬ 
idues, a reversible cross-linking peptide fingerprinting method 
that was previously used to identify the template-binding por¬ 
tions of the hepatitis C virus RNA-dependent RNA polymerase 


(4). Briefly, a biotinylated RNA is cross-linked to a protein by 
use of formaldehyde. The reaction is then treated with trypsin 
to digest the protein after lysines and arginines, and the RNA 
and RNA-peptide complexes are enriched by streptavidin affin¬ 
ity chromatography. After reversing the cross-links, the pep¬ 
tides are purified from the RNA and subjected to MALDI-ToF 
spectroscopy. The peptides are identified from peptides gener¬ 
ated by a virtual cleavage. 

The biotinylated RNA used in the reversible cross-linking 
peptide fingerprinting contained multiple uridylates to facili¬ 
tate binding by Nspl5 (16) and several cytidylates that contain 
amines that can be cross-linked with formaldehyde. The cata- 
lytically inactive mutant H249A was used in these experiments 
to maximize the production of RNA-protein complexes by 
avoiding catalytic cleavage of the substrate. A typical mass spec¬ 
trum for the H249A-RNA cross-linked sample is shown in the 
lower panel of supplemental Fig. S3, whereas the result from a 
control reaction performed in the absence of RNA is in the 
upper panel. After subtracting the peaks corresponding to 
streptavidin and/or trypsin, eight ions that could be attributed 
to H249A were identified (Fig. 5 B). Peptides containing over¬ 
lapping sequences to these eight were also observed when the 
samples were treated with chymotrypsin (data not shown), pro¬ 
viding a confirmation for their assignment. 

When the peptides were mapped back to the Nspl5 struc¬ 
ture, the pattern shown in Fig. 5C was observed. Notably, the 
affinity-purified peptides border the predicted paths identified 
by bioinformatics analysis. In addition, peptides corresponding 
to the active site were found. The location of the peptides, how¬ 
ever, did not allow us to identify a single path, suggesting either 
that all three are used at some level or that this analysis is insuf¬ 
ficient to discriminate among the possibilities. 

To obtain a better resolution of the RNA paths, we used the 
locations of the RNA-interacting peptides in the crystal struc¬ 
ture and sequence conservation to guide the design of addi¬ 
tional amino acid substitutions. This analysis required assump¬ 
tions of the more likely residues based on the probability of 
particular side chain atoms to participate in formaldehyde 
cross-linking as well as orientation of the side chains within the 
Nspl5 hexamer structure. Formaldehyde preferentially cross¬ 
links residues that have basic, aromatic, or sulfur-containing 
side chains (29). Based on the orientation of the side chains, 
alanine substitutions of residues His 18 , Asn 73 , Asn 82 , Asn 83 , 
Arg 181 , Phe 213 , and Fys 256 were constructed. In addition, based 
on their conservation and presence in the affinity purified tryp¬ 
tic peptides, alanine substitutions at Asn 28 , Asn 29 , and Thr 47 
were also included in the mutational analysis. S293A, P343G, 
Y342A, and F345G were previously made in our analysis of the 
active site, and they are known to have effects on RNA binding, 
respectively (Fig. 3D). Because of their proximity, Asn 28 and 
Asn 29 were mutated together, as were Asn 82 and Asn 83 . 
Because hexamer formation is a prerequisite for RNA binding, 
all of the mutants were examined for RNA binding and were 
found to be hexameric (data not shown). 

Next, we used polyU affinity resin to compare the stability of 
binding of mutant Nspl5 proteins with that of WT. Mutants 
K181A, F213A, and N82A,N83A bound to polyU resin at levels 
comparable with WT, with the peak of the proteins eluting at 
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FIGURE 5. A Model for a complex between Nsp15 and oligonucleotide RNA. A putative RNA binding path of sNsp15 hexamer. For clarity, only subunitTI A 
is emphasized. A U16 A-RNA oligonucleotide is manually docked through the narrow catalytic site of the enzyme. 6, summary of Nsp15 peptides obtained by 
biotin-streptavidin affinity chromatography and reversible cross-linking and trypsin digestion. The peptides were purified using Zip-tip as recommended by 
the manufacturer and subjected to MALDI-ToF. The expected masses obtained by virtual tryptic digestion of Nsp15 are denoted by (e) and the masses of 
observed peptide peaks from MALDI-ToF analysis are denoted by (0). The peptide position on the protein sequence is indicated by protein sequence as well as 
residue numbers. C, location of peptides identified by reversible RNA cross-linking and MALDI-ToF analysis. Identified peptides are colored pink. Potential RNA 
binding residues, along trajectories selected for mutational analysis are colored red. Active site residues are colored by element. D, affinity chromatography 
analysis of Nsp15 mutant and wild type proteins. All of the proteins were allowed to bind to polyU-agarose at 30 mM sodium chloride concentration followed 
by washing the slurry with sodium chloride concentrations indicated on top. Protein in each fraction was quantified by the Coomassie Plus protein assay kit 
from Pierce and is indicated as a percentage of the amount loaded, flavored RNA binding path on Nspl 5. Color scheme is as follows: protein subunits, shades 
of gray; active site residues, yellow; residues that affected affinity of protein for RNA, red; residues that did not affect RNA binding, green. A green line indicating 
RNA path is drawn on protein surface. 


400 mM NaCl (Fig. 5 D). Mutants H18A, T47A, N28/29A, N73A, 
and K256A were significantly decreased for RNA binding, with 
the peak of the proteins eluting at 200 mM NaCl or lower, sug¬ 
gesting that these residues directly or indirectly contribute to 
RNA binding. When located onto the Nspl 5 structure, the 
effects of the mutants indicate that Path III is most likely 
involved in stable binding to RNA (Fig. 5, A and E ). 


Final Comments —SARS-CoV Nspl5 displays a mechanism 
of endoribonuclease action that is identical to that of RNase A 
and likely uses a general acid-base mechanism for catalysis. 
RNase A is known to employ two histidines and a lysine for 
catalysis. Functional studies have demonstrated that two histi¬ 
dines (His 234 and His 249 ) and one lysine (Lys 289 ) are important 
catalytic residues for Nspl5 as well. Our structure and molec- 
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ular docking with UMP confirms the previous thesis of Ricagno 
et al. (21) that the catalytic pocket of Nspl5 bears a striking 
resemblance to that RNase A. Furthermore, we have function¬ 
ally elucidated the relevance of a number of residues in the 
Nspl5 hexamer that are important for specific recognition of 
uridine and discrimination against purines and cytidine. We 
also identified several residues that stabilize interaction with 
the substrate. Although the catalytic center is formed by the C 
terminus, residues from N as well as C termini were found to 
help stabilize protein-substrate interactions. Identified RNA- 
binding residues that are located on the “side” of the Nspl5 
hexamer supports our earlier proposal based on cryoelectron 
microscopy analyses and further identifies a path that feeds the 
RNA into active site. The information now available on the 
Nspl5 active site, the RNA binding path, and the contact sites 
required for oligomerization should be of interest in the design 
of Nspl5-specific inhibitors. 
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